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1 Introduction 



Nonlinear AR-ARCH models, i. e. models with an autoregressive conditional mean function 
and an autoregressive conditional variance function which are both not assumed linear, have 
become increasingly popular. They are also called CHARN (conditional heteroscedastic au- 
toregressive nonlinear) models. In this paper we assume an AR-ARCH model of order one. 
Our aim is to test for Gaussian distribution of the innovations, which constitutes a typical 
assumption in the modelling of time series data. Under the normality assumption, asymp- 
totic results often simplify. For instance, then no fourth moments appear in the asymptotic 
variance matrix of the empirical autocovariances of linear processes, see e. g. Brockwell & 
Davis (2006), Proposition 7.3.3. Further, the lack-of-fit test for ARCH models by Horvath, 
Kokozka & Teyssiere (2004) strongly depends on the assumption of Gaussian innovations. 
Araveeporn (2011) uses the assumption of Gaussian innovations in order to estimate the 
conditional mean and volatility functions. Furthermore, estimation of conditional quantiles 
is of great importance in the context of financial time series. Starting from a nonpara- 
metric AR-ARCH model, however, the quantiles of the innovation distribution need to be 
estimated, or better be known (see Franke, Kreifi & Mammen (2009), section 4). Moreover, 
under Gaussianity of the innovations one can derive asymptotically distribution-free versions 
of other specification tests; see the discussion below. 

We suggest a completely nonparametric test, which does not assume any parametric 
assumption on either mean or volatility function, but applies kernel estimators for those 
functions (see Doukhan & Ghindes (1983), Robinson (1983), Masry & Tj0stheim (1995), 
Hardle & Tsybakov (1997), among others, for estimation procedures in nonparametric AR- 
ARCH models). Relatedly, in a homoscedastic nonparametric AR model Miiller, Schick & 
Wefelmeyer (2009), who prove an asymptotic expansion of the empirical process of estimated 
innovations, mention the possibility to use their result for testing goodness-of-fit of the inno- 
vation distributions. They do not present the asymptotic distribution of the test statistics, 
nor finite sample properties. On the other hand, goodness-of-fit tests for the innovation dis- 
tribution in parametric time series are suggested by Koul & Ling (2006) in AR-ARCH models 
and by Klar, Lindner & Meintanis (2011) for GARCH models, for instance. Ducharme & 
Lafaye de Micheaux (2004) test for normality of the innovations in standard ARMA models. 

Our test statistic for Gaussianity of the innovations is the Cramer- von Mises distance of a 
weighted empirical distribution function of estimated innovations and the standard normal 
distribution. Though the mean and volatility functions are not specified in any way, the 
test statistic is shown to be asymptotically distribution-free. The test and its asymptotic 
distribution are presented in section 2A_ We treat the special cases of nonparametric AR 
and nonparametric ARCH models in detail in sections t 2?2_ and 2J3 In a small simulation 
study we demonstrate the good performance of the test for moderate sample sizes in section 
[3j We further discuss briefly how the test can be generalized to AR-ARCH models of higher 
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order or to models with additional covariates. 

As already mentioned, under Gaussianity of the innovations other testing procedures 
based on the residual empirical process will be asymptotically distribution-free. Then boot- 
strap procedures (for which asymptotic validity often is not investigated rigorously in the 
literature) can be avoided. As example for an asymptotically distribution-free specifica- 
tion test under the normality assumption we present a lack-of-fit test for standard AR(1) 
models in section |4| Further we reconsider the test for multiplicative structure by Dette, 
Pardo-Fernandez & Van Keilegom (2009) under the normality assumption. 

Finally, technical assumptions are listed in appendix |A| 



Assume we have observed X , . . . ,X n , where (X t ) te z is a real valued stationary a-mixing 
stochastic process following the AR-ARCH model of order one, i, e. 



Here the innovations e t: t & Z, are assumed independent and identically distributed with 
unknown distribution function F. Moreover, the innovations are centered with unit variance 
and e t is independent of the past X s , s < t — 1, W. 

Our aim is to test the null hypothesis H Q of standard normal innovations against a 
general alternative Hi. To this end we define kernel estimators for the conditional mean and 
conditional variance function as 



where K denotes a kernel function and c n a sequence of positive bandwidths. Technical 
assumptions are listed in the appendix. Now we estimate the innovations as residuals 



2 Main results 



2.1 AR-ARCH model 






£t = 



X-, 



\ - m(X t _i) 



and consider a weighted empirical distribution function, i. e. 



n 




(2.3) 



t=i 



as estimator for the innovation distribution. Here we define 



U7 n (A t _i) 
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while w n denotes some weight function fulfilling assumption (W) in the appendix. Selk & 
Neumeyer (2012) showed (see (A.l), (A. 3) and the arguments following in the proof of Th. 
3.1 in that paper) that under the assumptions stated in the appendix, 

m-m)(X t _ 1 ) o-(X t _i) 



F M = £ v n ,,l{s, < ^ + V^ ) \ 

1 v"^ Tr r - r s f(m — m){X t -i) (a 2 — a 2 )(X t -i)\ /n jN 

uniformly with respect to y e R, where / denotes the innovation density. Further in the 
aforementioned paper it is shown that 

E(m — m)(X t _i) 1 A . 1 , . . 



(a z -a z )(X t _i) 1 v-, 2 ,x , 1 > 
(see (A.5)-(A.7) in the cited paper). Thus, 



(2.6) 



^ E ( J & ^y} + fto* + VJ Y 1 ( £ t- !)) + M^) (2-7) 



n 

and the stochastic process 

yfr(F n (y)-F(yj), y G R, 
converges weakly to a centered Gaussian process (K(y)) yGR with 

Cov(K(j/),K(*)) = F(yAz)-F(y)F(z) 

+ f(y) (E[ £l I{ £l < z}} + yE[{e\ - l)I{e 1 < z}}) 
+ f(z) (E[ £l I{ £l < y}} + zE[{e\ - l)I{e l < y}}) 
+ f(y)f(z) {l + (y + z)E[sl\ + yz{E[e^ - 1)) . 

Now let <3> and ip denote distribution and density function of the standard normal distribu- 
tion, respectively, and denote by (G(-u)) ug [ 0i i] a centered Gaussian process with covariance 
structure 

Cov(G(u),G(u)) = u Av — uv — (/9($ _1 (-u))(/9($ _1 (f )). 
Then we have the following result for the Cramer-von Mises test statistic. 
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Theorem 2.1 Under model (2.1) and the assumptions stated in the appendix under the null 



hypothesis Hq of Gaussian innovations the test statistic 

T n = n [ (F n (y)-$(y)) 2 <p(y)dy 



converges in distribution to T = / G 2 (u) du. 

Jo 



Proof. A calculation of the covariance of IK in the case F = $ gives 

Cov(K(y),K(*)) = $(|/Az)-%)f(z) 

- 2</?(j/V0) - Zyz^(y)^(z) + tp(y)tp(z)(\ + 2yz) 
= $(y A z) - $(y)$(z) - <f(y)<f(z). 

because for e\ standard normally distributed one easily calculates E[e\I{ei < y}) = —<p(y), 
E[{e\ — l)/{ei < y}} = —y<f(y) and one has E[ef\ = 0, E[ef] = 3. From the continuous 
mapping theorem it follows that T n converges in distribution to 



K{y)Mv)dy= / (K($ -1 (w))) du 

JR JO 

while K o has the same distribution as G. This finishes the proof. □ 

Remark 2.2 It follows from Stephens (1976) that G is also the weak limit of some process 
(Y(u)) uE [o,i] such that f = f} Y 2 {u)du is the limit of 



T„ = n 



where Z\, . . . , Z n are iid with known variance r and unknown expectation fi and where $/i, T 2 
denotes the normal distribution function with expectation fi = n~ l Y^Jj=x %j an< ^ variance r 2 . 
That the limits of T n and T n coincide in distribution might be suprising because in our AR- 
ARCH model the variance is unknown and has to be estimated. However, as can be seen from 
the proof, in the asymptotic covariance of y/n(F n — $) under Hq exactly those terms cancel 



that arise from the estimation of the variance function a 2 (cf. (2.6)). Quantiles of T and thus 
critical values for T are tabled in Stephens (1976) and restated in Table [l] for convenience. 
We obtain an asymptotically distribution-free test by rejecting Hq for asymptotic level a 
whenever T n is larger than the critical value c a . Consistency can be deduced from uniform 



convergence of F n to F in probability, which follows from (2.7) 
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nominal level a 


0.15 


0.1 


0.05 


0.025 


0.01 


critical value c a 


0.118 


0.135 


0.165 


0.196 


0.237 



Table 1: Asymptotic critical values for the Cramer-von Mises test for normality of the 
innovations in the AR-ARCH model. 



Remark 2.3 The results for the nonparametric AR-ARCH model (2.1) can be extended 
to models of the form X t = m(Z t ) + a(Z t )e t where (X t , Z t ) is a stationary time series 
and Z t may include covariates as well as a finite number of past values X t -i, . . . , X t - P 
while E[e t \ J^t-i] = 0, Var(e t | Tt-i) = 1. Here Tt-\ denotes the sigma-field generated 
by Z t , (X t _i, Z t _i), (X t _ 2 , Z t _ 2 ), . . . . We conjecture that applying local polynomial esti- 
mators for m and a 2 and assuming enough smoothness of those functions the expansion 



(2.7) stays valid. A thorough treatment is beyond the scope of the paper. Note also that 
asymptotic properties of estimators for the innovation distribution in such nonparametric 
AR(p) /regression models have not yet been treated in the literature. However, in the case 
of independent observations Neumeyer & Van Keilegom (2010) showed validity of an expan- 



sion like (2.7) for the empirical distribution of residuals in multiple nonparametric regression 



models (they obtain the same expansion as in the case of one-dimensional covariates, see 



Akritas & Van Keilegom (2001)). Thus we believe that Theorem 2.1 stays valid in the more 
general model under suitable regularity conditions and the same test for Gaussianity of the 
innovation distribution can be applied. ■ 

2.2 AR model 

In this section we consider a nonparametric AR model of order one, i. e. 

X t = m(X t -i) + Vt, (2.8) 
where the innovations r] t , t G Z, are iid and centered and n t is independent of the past X s , 



s < t — 1. Thus we have model (2.1) with r\ t = oe t for the unknown (constant) variance 
a 2 = Vai(n t ). Our aim is to test the null hypothesis of normal innovations, i.e. 

H : 3a 2 > s.t. rj t ~ iV(0,a 2 ). 

The constant variance is estimated by 



V 2 = J2 V n,t( X t-™(Xt-l)) 



t=l 



where rh is the kernel estimator defined in (2.2). In this case we define the residuals as 



A _ X t - m{X^) 

£ t — ^ 

o 
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and consider F n as defined in (2.3) as estimator for the distribution of the standardized 
innovations e t . Let again F and / denote distribution and density function of e t , respectively. 
Then H is equivalent to F = $ and we have the following results. 

Lemma 2.4 Under the assumptions stated in the appendix we have the expansion 



i 71 / 

F n (y) = -J2[l{et<y} + f(y)e t + 



yf(y) 



ei-l) +0 P 



n 



t=i 



n 



t=i 



uniformly with respect to y 6 R. 

Proof. First we consider the variance estimator, for which one obtains 
1 n 

a 2 - a 2 = -)v n t (r)t - (m - m)(X 4 _i)) 2 - a 2 

^ n n n 

= o- 2 -^2v n:t (e 2 - 1) - 2a^2v nit e t (rh - m)(X t ^i) + ^v n ^ t {rh - m) 2 (X t _ 1 ) 
n t=i t 

n 1 

= a 2 22 v nA £ t - !) + °p(-?, 
t=i * • 

by Lemmata B.2 and B.3 in Selk & Neumeyer (2012). Further 

n 
t=l 

and 

1 - I 

- W n (X t -i) = 1 + P {—: 

t=i 

(see also (A. 4) in Selk & Neumeyer (2012)) and thus 

- 2 -- 2 = a 2 i^-l) + o P (4=). 

Now note that 

n 

F n(y) = ^VnjI^Et < 



(2.9) 



t=i 



(rh — m)(X t -i) o 

A ' +y- 

a a 



and with arguments analogous to the proof of Theorem 3.1 in Selk & Neumeyer (2012) which 
leads to (2.4) (see also the proof of Lemma 1 in Dette, Pardo- Fernandez & Van Keilegom 
(2009) or the proof of Theorem 3.1 in Miiller, Schick & Wefelmeyer (2009)) one obtains that 

(m - m){X t _ 1/ 

t=l 



1 n n I 

t=l t=l 



a 



+ y- 



a 2 -a 2 



2a 2 



+ o P ( 



n 



uniformly with respect to y G R. The assertion follows from (2.5) and (2.9). 



□ 
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Corollary 2.5 Under model (2.8) and the assumptions stated in the appendix under the null 
hypothesis Hq of normal innovations 

T n = n [ (F n (y)-$(y)) 2 <p(y)dy 



converges in distribution to T defined in Theorem 2. 1 



Proof. The result immediately follows from Lemma 2.4 and Theorem 2.1 



□ 



Thus using the critical values from Table [T] we obtain a completely nonparametric con- 
sistent distribution-free asymptotic level a test for Gaussianity of the innovations in AR 



models (see also Remark 2.3 which holds here analogously). 



2.3 ARCH model 

In this section we consider a nonparametric ARCH model of order one, i. e. 

X t = a{X i -i)e t , (2.10) 
where the innovations e tl t 6 Z, are iid and centered with unit variance and e t is independent 



of the past X s , s < t — 1. Thus we have model (2.1 ) with conditional mean m = 0. Our aim 
is to test the null hypothesis Hq of normal innovations. To this end let a 2 be estimated by 
the kernel estimator 

Z1 1 k(^)x? 



En jv- ( x—Xj-i 



and the residuals be defined as 



X t 



o~(X, 



t-i, 



whereas F n is as in (2.3). Let again $ denote the standard normal distribution function and 



let B denote a standard Brownian bridge on [0,1]. Then we obtain the following result. 
Theorem 2.6 Under the assumptions stated in the appendix under the null hypothesis of 



Gaussian innovations in the ARCH model (2.10) the test statistic 



converges in distribution to T 



n / (F n (y)-$(y)) 2 <p(y)dy 



(u) du. 



Proof. In the expansion (2.4) the estimation of m is not present while (2.6) stays valid 



for the new estimator a. Thus it follows that 



1 - / 

n t=i ^ 



yf(y) 



e -1) )+o P 



n 
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and the stochastic process 

yfii(F n (y)-F(yj), y e R, 
converges weakly to a centered Gaussian process (K(y)) ye R with 

Cav(K(j/),K(z)) = F(yAz)-F(y)F(z) 

+ f(y)yE[(el - l)J{ ei < z}} + f{z)zE[{e\ - l)I{e x < y}] 
+ f(y)f(z)yz(E[ej]-l). 

As the innovations are standard normally distributed we have E[(ef — l)I{ei < z}} = —ztp(z), 
f = ip and E[ef\ = 3. Hence, 

Cav(K(y),K(z)) = $(y A z) - Q(y)${z). 

The assertion follows by the continuous mapping theorem noting that IK o $ _1 is a standard 
Brownian bridge. □ 

For convenient reference we state the critical values for the test in Table |2j Here c a is 
the (1 - a)-quantile of T = M 2 (u) du, see Shorack & Wellner (1986), p. 147. 



nominal level a 


0.15 


0.1 


0.05 


0.02 


0.01 


critical value c a 


0.284 


0.347 


0.461 


0.6198 


0.743 



Table 2: Asymptotic critical values for the Cramer-von Mises test for normality of the 
innovations in the ARCH model. 



3 Simulations 

To examine the performance of the test on small samples we consider AR(1) models and 
ARCH(l) models, for which we compare the results under the assumption of an AR-ARCH 



model like (2.1) and under the assumption of an AR model like (2.8) (respectively ARCH 



like (2.10ft). 



For the AR(1) case we consider the models 

X t = 0.5-Xt-i +e t , St,..., e n ~ F c , 
where F$ denotes the skew-normal distribution with location parameter 



2vr ((5C) 2 + (5C) 4 ) 



vr 2 + (2vr 2 - 2tt) • (5C) + (tt 2 - 2vr) • (5() 



i ■ 
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scale parameter (ir(l + (5C) 2 ) 1 / 2 / '(it + (vr — 2)(5C) 2 ) 1 / 2 and shape parameter 5( for different 
values of (. For ( = this is the standard normal distribution. The rejection probabilities for 
500 repetitions and level 5% are displayed in Tableland Figure [T] for the AR-ARCH model 



(2.1 ) and the AR model (2.8) respectively. It can be seen that the level is approximated well 



and the power increases for increasing parameter ( as well as for increasing sample size n. 



% 


C = o 


C = 0.1 


C = 0.2 


C = 0.3 


( = 0.4 


C = 0.6 


C = 0.8 


C = i 


AR-ARCH 


















n = 100 


4.8 


5.4 


8.6 


12.2 


26.2 


48.8 


62 


75.4 


n = 200 


5 


6 


8.8 


18.6 


44 


83.6 


94 


96.2 


AR 


















n = 100 


5 


7.4 


9 


15.6 


27 


53.2 


67.4 


76.6 


n = 200 


5 


6.8 


8.8 


19 


41.8 


77.8 


93.6 


98 



Table 3: Rejection probabilities obtained from AR(1 ) models with skew-normally distributed 
innovations 




Figure 1: Rejection probabilities obtained from AR(1) models with skew-normally dis- 
tributed innovations for n = 100 (dashed curve) and n = 200 (solid curve). On the left 
panel the results for the AR-ARCH model are shown and on the right the ones for the AR 
model. 



We also examine ARCH(l) models with the same innovation distribution, 



X t = ^0.75 + 0.25Xta • e u e u . . . , e n ~ F c 

for different values of (. The rejection probabilities for 500 repetitions and level 5% are 
shown in Tableland Figure [2] for the AR-ARCH model flO and the ARCH model (|2!6|) 
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respectively. The asymptotic level is approximated well and the power increases with in- 
creasing ( as well as with increasing n. For the ARCH model the increase with £ for small n 
is not as pronounced as for the models considered before, therefore we additionally examined 
this model with sample size n = 500 for which an increase comparable to those before can 
be observed. 



% 


C = o 


C = o.i 


C = 0.2 


C = 0.3 


C = 0.4 


C = 0.6 


C = 0.8 


C = i 


AR-ARCH 


















n = 100 


5.2 


6.6 


8.2 


15.8 


33 


61.8 


72.8 


82.6 


n = 200 


4.8 


6.2 


7.8 


22 


47.4 


82.2 


94.8 


98.8 


ARCH 


















n = 100 


5 


6.2 


6.4 


8 


16.2 


24.4 


31.2 


39.6 


n = 200 


5.2 


6 


7.2 


10.4 


15 


34 


48.8 


55.8 


n = 500 


5 


5.6 


6.8 


14.2 


31.8 


68.6 


87 


91.2 



Table 4: Rejection probabilities obtained from ARCH(l) models with skew-normally dis- 
tributed innovations 




Figure 2: Rejection probabilities obtained from ARCH(l) models with skew-normally dis- 
tributed innovations for n = 100 (dashed curve), n = 200 (solid curve) and n = 500 (dotted 
curve). On the left panel the results for the AR-ARCH model are shown and on the right 
panel the ones for the ARCH model. 



To further study the power of the testing procedure, we examine the same models with 
Student-t distributed innovations with different degrees of freedom. Due to the fact that 
Var(e t ) has to be one for all t, the Student-t distribution was standardized. The models are 
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AR(1) 



X t = 0.5 • X t -i + £ t , £!,...,£„ ~ St(() 



and ARCH(l) 



X t = J0.75 + 0.25A7_! • e t , e u 



for different values of (. The rejection probabilities for 500 repetitions and level 5% are 
displayed in Table [51 Figure M (AR(1) model) and Figure 111 (ARCH(l) model). As before 



we compare the results under the assumption of an AR-ARCH model like (2.1) and under 



the assumption of an AR model like (2.8) (respectively ARCH like (2.10)). It can be seen 
that the power is good and increases for increasing sample size n while it decreases for 
increasing parameter (, because the Student-t distribution converges to the standard normal 
distribution for increasing degree of freedom. 



% 


C = 3 


C = 4 


C = 5 


C = 6 


C = 7 


C = 3 


C = 4 


C = 5 


C = 6 


C = 7 


AR-ARCH 






















n = 100 


61.4 


49.8 


35.2 


27.8 


23.6 


60.4 


45.2 


35.8 


26.4 


22.6 


n = 200 


90 


71.8 


52.2 


45,4 


34.6 


91 


72.2 


52.4 


37.2 


31 


n = 500 


100 


98.8 


89.6 


77.6 


62 


100 


98.4 


86.6 


75 


59.2 


AR/ARCH 






















n = 100 


23.4 


15.8 


15.4 


11 


9.8 


18 


9.8 


7.6 


7.2 


5.2 


n = 200 


54.8 


25.6 


10.8 


9.8 


6.4 


47.2 


22.4 


12 


6.6 


6.2 


n = 500 


99 


74.6 


41.8 


22.4 


18.6 


96.8 


70.2 


37 


19.6 


13.4 



Table 5: Rejection probabilities obtained from AR(1) models (left) and ARCH(l) models 
(right) with St(() distributed innovations 



Simulation setting. For each simulation 10 • n observations X t were generated. For 
the test the last n observations were used. This was done to ensure that the process is in 
balance. 

The empirical processes were built with weight function w n = /[-i og (n),iog(n)]- The Nadaraya- 
Watson estimators m and o were calculated with Gaussian kernel. This is not compatible 
with all assumptions, e. g. the support of the kernel is not compact. However this has 
negligible effect on the simulations because the Gaussian kernel decreases exponentially fast 
at the tails. The bandwidth was chosen according to the assumptions by a rule of thumb as 
b 2 n^. with a 2 = J2t=i v nA X t - rh{Xt-i)f- 
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degree of freedom zeta 



degree of freedom zeta 



Figure 3: Rejection probabilities obtained from AR(1) models with St(() distributed inno- 
vations for n = 100 (dashed curve), n = 200 (solid curve) and n = 500 (dotted curve). On 
the left panel the results for the AR-ARCH model are shown and on the right panel the ones 
for the AR model. 




Figure 4: Rejection probabilities obtained from ARCH(l) models with St(Q distributed 
innovations for n = 100 (dashed curve), n = 200 (solid curve) and n = 500 (dotted curve). 
On the left panel the results for the AR-ARCH model are shown and on the right panel the 
ones for the ARCH model. 



4 Further examples 
4.1 Testing for linear AR(1) 

As was mentioned in the introduction other distribution-free specification tests for the AR- 
ARCH model can be derived once the Gaussianity of the errors has been established. In this 
section we will study in detail a lack-of-fit test for the linear AR(1) model. For the method 
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compare Van Keilegom, Gonzalez Manteiga, & Sanchez Sellero (2008) in a nonparametric 
regression model with independent observations. Similarly one can derive tests, e.g., for 
parametric ARCH models. 

For simplicity here we assume that i£[Xo] = as this is a typical assumption in AR 
models. We consider the model 

X t = m(X t - 1 )+ae t , (4.1) 

where the innovations e t , t G Z, are iid standard normal and a is an unknown positive 
constant. Further, e t is independent of the past X s , s < t — 1. Our aim is to test the null 
hypothesis 

Ho '■ 3$ G (-1, 1) s. t. m(x) = fix. 

See Hong-zhi & Bing (1991) or Koul & Stute (1999) for other procedures to test for H . 

Let $ denote any (under H ) y^-consistent estimator for e. g. the Yule- Walker or 
ordinary least squares estimator under suitable regularity assumptions. Now define residuals 
under the null as 

x t - dx t ^ 

S ,t - T 

with a 2 = n~ l J2ti( X t ~ ^-i) 2 , and 



t=l 



Let further F n be defined as in section 2.2 Then we have the following result. 



Theorem 4.1 Under the assumptions stated in the appendix for model (4-1) with Gaussian 
innovations we have under H that 

T n = 2^n [ {F n {y) - F , n (y)) 2 dy 

JR 

converges in distribution to a xf- distributed random variable. 

Proof. Let $o denote the 'true' parameter under Hq. Similar to the proof of Lemma 2A 
we have for standard Gaussian e t that 



n 

t=l t=l 
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uniformly with respect to y 6 R. Note that for the a 2 defined here the last equality in (2.6) 
also holds. Further, 



1 n 

-Vx t _! = E[X ] + o P (l) = o P (l) 

n < ' 



n 
t=l 



because (X t ) te z is ergodic due to its mixing property (see e.g. Doukhan (1994)). Thus with 
the i/n-consistency of $ we obtain 



1 n 
' t=l 



£ -i +M , 

n 



uniformly with respect to j/ 6 R. Now from Lemma 2.4 we have 



1 " 

V^(F n (y) - h, n (y)) = v(y)^y2^t + o P (i) 



which converges in distribution to <p(y)Z with a standard normally distributed Z. Thus T n 
converges in distribution to 

2 A / &(y)) 2 dyZ 2 = Z 2 , 

which is Xi"distributed. □ 

An asymptotically distribution-free level-a test is obtained by rejecting the null hypoth- 
esis H of a standard AR(l)-model whenever T n is larger than the (1 — a)-quantile of the 
xf-distribution. 



4.2 Testing for multiplicative structure 

Under the assumption of Gaussian innovations the test for multiplicative models by Dette, 
Par do- Fernandez & Van Keilegom (2009) simplifies. Here the null hypothesis to be tested 



for model (2.1 ) is 

Hn '■ 3c s. t. m — ccr. 



This condition connects ARCH models of the form Z t = s(Z t _i)e t to model (2.1) by setting 
X t = Z 2 = m(X t _i) + <r(Xt-i)e t with e t = e 2 - 1 and c = {E[e\} - l)" 1 / 2 . 

Note that under Hq the constant c can be estimated -y/n-consistently by a least-squares 
estimator c defined by Dette, Pardo- Fernandez & Van Keilegom (2009). Its asymptotic 
expansion under Hq and under our assumptions is 

= „ £ ( - 2 C£ ' + £ < + 2 C ) + 0p( vs 1, (4 ' 2) 
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see Theorem 5 in the aforementioned paper, but note that under our assumptions given in 
the appendix the influence of the weight function vanishes asymptotically. Now define 



F , n (y) = ^v n! tl{e ,t<y} 
t=\ 

X t - ca{X t _ x ) 



with residuals 



a 



(X; 



t-l, 



where a is as in (2.2). Let for k e {4, 8}, 

n 

Sk = ^Vn^iXt-x) 



4=1 



and f 2 = Sg/sf — 1. Finally, let F n be defined as in (2.3). Then we have the following result. 



Theorem 4.2 Under the assumptions stated in the appendix for model (2.1) with Gaussian 
innovations we have under H that 



T 



2^n f R (F n (y) - F , n (y)) 2 dy 



(fc 2 + l) 2 f 2 



converges in distribution a xt' distributed random variable. 

Proof. We only sketch the main differences to the proof in Dette, Pardo-Fernandez & 
Van Keilegom (2009) due to slightly different assumptions (under which in particular the 
influence of the weight function is asymptotically negligible). We have 



n 

F ,n(y) = ^2 V n,tl\e t < 
4=1 



(ca - ca)(X t -i) <7pQ_] 

— + y- 



n 



1 Tr / x / (ccr — ca)X t -i 

-E>^<y}^(.y)E'M [ a{Xl l) +y 

4=1 4=1 K 1 1J 



a 2 (X t ^) - a\XU, 



2a 2 {X, 



t-i, 



y/n 



uniformly with respect to y G K (compare to (2.4)). From this and (4.2), (2.4), (2.6) one 
obtains 

V^(F n (y) - F M) = <p(y)± f; (lc(e 2 t - 1) - e t ) (l - + o P (l) 

which by Th. 2.21 in Fan & Yao (2003) converges in distribution to ip(y)Z, where Z is 
centered normally distributed with variance (|c 2 + l)r 2 with 



r 2 = E 



a\X, 



t-i, 



E[a\X )} 



E[a 8 (X )} 
(E[^(X )]y 



1. 
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Finally the assertion follows because f 2 consistently estimates r 2 because a consistently 
estimates a and (a(X t )) te z inherits the mixing property of (X t ) te z and is therefore ergodic 
as well. Thus T n converges in distribution to 

C 7 2 7 2 

2>/* (<p(v)) 2 dv ' 



(| C 2 + 1) T 2 (f C 2 + l) T 2' 

which is Xi-distributed. □ 

One obtains an asymptotically distribution-free test for Ho and thus avoids to implement 
bootstrap procedures. Note that it is not obvious which kind of bootstrap procedure should 
be applied here in the context of arbitrary innovation distributions. 

A Technical assumptions 

The assumptions are similar to those in Selk & Neumeyer (2012) and required for their 
Theorem 3.1 which we use. 

(K) The kernel K is a three times differentiable density with compact support [— C, C] and 
su P«e[-c,c] \K M (u)\ < K < oo, fi = 0, 1,2,3. Moreover K(C) = K{-C) = K'(C) = 
K'(-C) = and / K{u)udu = 0. 

(C) The sequence of bandwidths c n fulfills 

nctttogny ^ 0, ( logn ^ _» o for all V > 0. 

(I) For the interval /„ = [a n ,6 n ] some Ti < oo exists, such that (b n — a n ) = 0(\og(n) ri ). 
Moreover ( f^ K fx Q (x)dx + f£_ K fx (x)dx) = o((logn)~ 1 ), where f Xo denotes the 
density of Xo- 

(W) The weight function w n : R — > [0, 1] fulfills w n (x) = 1 for x G [a n + k, b n — k] and 
w n (x) = for x [a n , b n ] for some n > independent of n and is three times differen- 
tiable such that sup neN sup^jj \ w^\x)\ < oo for fi = 1, 2, 3. 

(F) The innovations Ej, j G Z, are independent and standard normally distributed. 

(E) Some b > 1 + V3 exists such that E [|X | 26 ] < oo. 

(X) The observation process (Xj)j e z is a-mixing with mixing-coefficient a(n) = O^n" 13 ) for 
some 

,' (3 + V3)b + 2 + V3 _ 
> max | 2 — — , 7 

^ + y/3)b - 2(2 + y/3) 
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Their density fx is bounded and four times differentiable with bounded derivatives. 
The density is also bounded away from zero on compact intervals and some r/ < oo 
exists, such that =-= — — t — = 0((\ogn) r f). 

(Z) It holds that 

sup ((\m(x)\ + \a(x)\) 2k f Xo (x)) =0(1) 
and there exists some 1 < j* < oo such that 

sup ((Im^l + I^IJ^K^I + KaODV^-xC^aO) =0(1) 

_1 _1 1 

is valid for all j > j*+l, for = 1, 2, n — oo with J„ = [a n — (0+c n 2 n 2 (logn) z)c n , 6 n + 

— i _ 1 1 

(0 + c„ 2 n 2 (logn)2)c n ]. 

(M) The regression function m and the scale function a are four times differentiable and 
there exist some r q ,r s < oo and q n , q° with snp x&[an _ CcnK+Ccn] \m^(x)\ = 0(q n ), 
snp xe[an _ Ccn:bn+Ccn] \<tM(x)\ = 0(q n ), fi = 0,1,2,3,4 and .^J ^ = 0(<&), where 
q n = 0((logn)^), ql = 0((logn) r -), (g™)" 1 = 0(1), (g^)" 1 = 0(1). 

Remark A.l Note that the mixing condition in (X) is weaker than the one in Selk & 
Neumeyer (2012). This is due to the nonsequential case that is examined here for which the 
proof of Lemma B.3 in the aforementioned paper can be simplified. Further note that the 
assumptions above are formulated under the null hypothesis H of Gaussian innovations. To 
obtain consistency of the testing procedures one needs to replace assumption (F) by 

(F') The innovations Ej, j G Z, are independent and identically distributed with distribution 
function F. Their density / is continuously differentiable and sup 4gR |/(t)i| < oo as 
well as sup tgR \ f'(t)t 2 \ < oo. Further, E \\ei\ 2b ~\ < oo for b from assumption (E). 



For AR model (2.8) with rjt = cre t some conditions in (Z) and (M) are redundant because a 
is a constant function. A similar remark holds for the ARCH model (2.10) where m = 0. ■ 
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